Turbine Tank Turbine Changes #437

saienduri · 2024-02-14T11:56:18Z

This commit adds the implementation for turbine tank. This allows us to currently run llama, sd models, and 30 other models e2e and upload model artifacts to Azure. There currently is a folder uploaded into Azure tankturbine storage container using turbine tank if you are interested in structure and how we keep track of the version using the date + git_sha: container_link. The torch IR for every model is uploaded when the user choses to upload using turbine tank. Accuracy against a torch run is also tested for every model we can run e2e. This allows us to keep track of the accuracy of our models and avoid regressions. There is also an option to download turbine tank model artifacts locally, so you don't have to run the models. It is setup so that it will not redownload and use cached artifacts if already there. It will also update your local artifacts if they are not up to date. Further detail can be found in the code comments.

To run turbine tank:
Build turbine like normal.
Run python models/turbine_models/turbine_tank/run_tank.py

I will add the corresponding nightly CI job once this lands.

dan-garvey

I think the upload is something only a builder should do, so exposing it as a command line flag is probably not ideal. Additionally make sure the storage is publically READ only, privately writable.

Also, tests

models/turbine_models/turbine_tank/run_models.py

dan-garvey · 2024-02-14T20:02:40Z

@monorimet please ensure the sd stuff meets the modularity requirements you have on the frontend

…esting

saienduri · 2024-02-21T18:45:15Z

I think the upload is something only a builder should do, so exposing it as a command line flag is probably not ideal. Additionally make sure the storage is publically READ only, privately writable.

Also, tests

Upload is not a command line flag anymore. Storage is publicly read only. Everything is also a test now.

models/turbine_models/custom_models/sd_inference/vae.py

models/turbine_models/custom_models/sd_inference/unet.py

models/turbine_models/custom_models/sd_inference/clip.py

dan-garvey · 2024-02-21T19:10:33Z

models/turbine_models/turbine_tank/turbine_tank.py

+
+storage_account_key = "XSsr+KqxBLxXzRtFv3QbbdsAxdwDGe661Q1xY4ziMRtpCazN8W6HZePi6nwud5RNLC5Y7e410abg+AStyzmX1A=="
+storage_account_name = "tankturbine"
+connection_string = "DefaultEndpointsProtocol=https;AccountName=tankturbine;AccountKey=XSsr+KqxBLxXzRtFv3QbbdsAxdwDGe661Q1xY4ziMRtpCazN8W6HZePi6nwud5RNLC5Y7e410abg+AStyzmX1A==;EndpointSuffix=core.windows.net"


probably dont want this public for security reasons, use a github secret or something

Good point, I made them environment variables that we can pass in using github secrets for our github actions workflow.

dan-garvey · 2024-02-21T19:10:56Z

models/turbine_models/turbine_tank/tank_util.py

+
+
+def preprocess_input_image(model_name):
+    # from datasets import load_dataset


lots of dead code

all of this code is used in test_tank.py as we use the different get methods which generate test input, torch output to compare to, and return a HF model to build/run based on the model_name we are trying to run through e2e. All the preprocessing and helper methods are necessary.

saienduri · 2024-02-22T01:56:23Z

By the way, the test turbine models test failing here is not due to this code (passes with my venv that hasn't been updated). With the latest dependencies (specifically transformers), we are hitting this issue: TypeError: llama_pos_shift_attention_forward() got an unexpected keyword argument cache_position. I manually ran the test models job on the main branch, and it fails as well: https://github.com/nod-ai/SHARK-Turbine/actions/runs/7996420641. Specifically, LlamaRotaryEmbedding definition has changed and use of seq_len argument is deprecated and unused in forward. I can pin transformers dependency to a specific version until we figure this out if we want.

dan-garvey · 2024-02-25T15:27:41Z

lint this when you get a chance

dan-garvey

small nit I probably missed on a previous pass
other than that looks good to land

dan-garvey · 2024-02-29T02:06:41Z

models/turbine_models/custom_models/sd_inference/vae.py

+        with open(f"{safe_name}.mlir", "w+") as f:
+            f.write(module_str)
+        model_name_upload = hf_model_name.replace("/", "_")
+        model_name_upload = model_name_upload + "-vae-" + variant


nit: use underscores instead of dashes for consistency here and elsewhere

the underscore is only used to separate the org and model name. Rest is all '-' (CompVis_stable-diffusion-v1-4-vae-decode)

saienduri added 3 commits February 14, 2024 03:52

turbine tank

94c09e4

azure dep

2139ab0

update compile_to_vmfb and sort for download

d323e63

saienduri requested review from stellaraccident, dan-garvey, IanNod and monorimet February 14, 2024 19:22

dan-garvey requested changes Feb 14, 2024

View reviewed changes

models/turbine_models/turbine_tank/run_models.py Outdated Show resolved Hide resolved

saienduri added 3 commits February 21, 2024 02:05

update tank to add 30 models using general flow + leverage existing t…

d584a20

…esting

formatting

b1ad572

remove unnecessary upload_ir var pass in utils

a465a48

saienduri requested a review from dan-garvey February 21, 2024 18:45

dan-garvey requested changes Feb 21, 2024

View reviewed changes

saienduri and others added 5 commits February 21, 2024 12:31

address comments

ac89976

add line back

cde33d9

back to hardcoded credentials

385d910

Merge branch 'nod-ai:main' into turbine_tank

8f8102f

update to env vars

07bcc6f

saienduri requested a review from dan-garvey February 22, 2024 01:49

add support for external param flow

d149901

saienduri and others added 6 commits February 25, 2024 22:52

Merge branch 'nod-ai:main' into turbine_tank

2f519e7

formatting

f97e3ce

Merge branch 'nod-ai:main' into turbine_tank

99db048

remove debug

8f48a80

Merge branch 'nod-ai:main' into turbine_tank

60b8fd5

moving turbine tank out to test suite

826e251

saienduri added 3 commits February 28, 2024 14:14

add for schedulers too

821ecf3

better var name

8bcac10

empty init file

7736efc

saienduri changed the title ~~Turbine Tank~~ Turbine Tank Turbine Changes Feb 28, 2024

clean checks

45a3a27

dan-garvey approved these changes Feb 29, 2024

View reviewed changes

saienduri added 2 commits February 28, 2024 18:16

address nit

a898364

revert nit

4d7edfa

saienduri merged commit fe20538 into nod-ai:main Feb 29, 2024
3 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Turbine Tank Turbine Changes #437

Turbine Tank Turbine Changes #437

saienduri commented Feb 14, 2024 •

edited

Loading

dan-garvey left a comment •

edited

Loading

dan-garvey commented Feb 14, 2024

saienduri commented Feb 21, 2024

dan-garvey Feb 21, 2024

saienduri Feb 21, 2024

dan-garvey Feb 21, 2024

saienduri Feb 21, 2024 •

edited

Loading

saienduri commented Feb 22, 2024 •

edited

Loading

dan-garvey commented Feb 25, 2024

dan-garvey left a comment

dan-garvey Feb 29, 2024

saienduri Feb 29, 2024



		def preprocess_input_image(model_name):
		# from datasets import load_dataset

Turbine Tank Turbine Changes #437

Turbine Tank Turbine Changes #437

Conversation

saienduri commented Feb 14, 2024 • edited Loading

dan-garvey left a comment • edited Loading

Choose a reason for hiding this comment

dan-garvey commented Feb 14, 2024

saienduri commented Feb 21, 2024

dan-garvey Feb 21, 2024

Choose a reason for hiding this comment

saienduri Feb 21, 2024

Choose a reason for hiding this comment

dan-garvey Feb 21, 2024

Choose a reason for hiding this comment

saienduri Feb 21, 2024 • edited Loading

Choose a reason for hiding this comment

saienduri commented Feb 22, 2024 • edited Loading

dan-garvey commented Feb 25, 2024

dan-garvey left a comment

Choose a reason for hiding this comment

dan-garvey Feb 29, 2024

Choose a reason for hiding this comment

saienduri Feb 29, 2024

Choose a reason for hiding this comment

saienduri commented Feb 14, 2024 •

edited

Loading

dan-garvey left a comment •

edited

Loading

saienduri Feb 21, 2024 •

edited

Loading

saienduri commented Feb 22, 2024 •

edited

Loading